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Abstract. We derive a combinatorial identity which is useful in studying the distri- 
bution of Fourier coefficients of L-functions by allowing us to pass from knowledge of 
moments of the coefficients to the distribution of the coefficients. 

1. Introduction 

Recently M. Ram Murty and K. Sinha HMSII proved effective equidistribution results 
showing the eigenvalues of Hecke operators on the space of cusp forms of weight k 
and level agree with the Sato-Tate distribution. Their proof relied on bounding the 
discrepancy through an application of the Erdos-Turan inequality and estimates of expo- 
nential sums. In HMMI the first two authors generalized their techniques to the Fourier 
coefficients of families of elliptic curves. The purpose of this note is to describe an 
interesting combinatorial identity needed in that analysis. 

We first describe the problem that motivated this work. Recall that if E : y'^ = 
+ Ax + B with A, 5 G Z is an elliptic curve over Q, the associated L-function is 

n=l P f J 

with A = — 16(4y4'^ + 27 B^) the discriminant of E, xo the principal character modulo 
A, and 



(^e{p) = P — y) G {Z/pZy : y"^ = x^ + Ax + B mod p} 

'x^ + Ax + 5' 



- E 



X mod p 



V 



(1.2) 
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By Hasse's bound we know |a£;(j9)| < 2^, so we may write aE^p) = 2y/p cos 9e{p), 
where we may choose 9e{p) G [0,7r]. The distribution of the a£;(j9)'s are related to 
numerous problems of interest; for example, by the Birch and Swinnerton-Dyer conjec- 
ture the order of vanishing of L{E, s) at the central point s = 1/2 is conjecturally equal 
to the group of rational solutions of E. See [SiiniSil2llSTII for more on elliptic curves. 

In the analysis in HMML one needs to understand sums of cos{m9„), with n ranging 
over a family of L-functions. Such estimates exist UKallMicilNil . and have been used by 
others to prove effective equidistribution results for two-parameter families of elliptic 
curves [BSi lShlllSh2L It is possible to avoid these estimates if instead one uses results 
of Birch [Bil for sums of the moments, i.e., sums of cos''(^^„). While typically these lead 
to worse results, as there may be situations in future research where only the moments 
are known we describe how one may prove effective equidistribution results concerning 
the distribution of the Fourier coefficients of L-functions using just the moments and 
combinatorics. 

The key combinatorial ingredient in HMMH is the following, which is the main result 
of this paper. 

Theorem 1.1. Let m be an integer greater than or equal to 1. Then 



Jni\fm + r\ 1 _ J 1/2 if m = 1 

\r)\ r ){r + l){m + r) |o if m > 2. 

We prove this theorem in ^ and then discuss its application to effective equidistri- 
bution in ^ 

2. Combinatorial Identities 

Below we give two different proofs of Theorem 11.11 each highlighting a different 
approach to proving combinatorial identities. We first state some needed properties of 
the binomial coefficients. For n,r non-negative integers we set (^) = ^T(^^3^. We 
generalize to real n and k a positive integer by setting 

'n\ ^ n{n-l)---{n-{k-l)) 
Jij k\ ' 

which clearly agrees with our original definition for n a positive integer and vanishes 
when n is a non-negative integer less than k. Finally, we set (p) = 1 and (^) = if A; is 
a negative integer. 

To prove our main result we need the following two lemmas; we follow the proofs in 
[ITOdll . 

Lemma 2.1 (Vandermonde's Convolution Lemma). Let r, s be any two real numbers 
and k, m, n integers. Then 

r \ / s \ / r + s 



\m + k J \n — k J \m + n 



(2.2) 
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Proof. Note that the summand is zero if either m + k>roxn — k>s, and thus it is a 
finite sum over k. It suffices to prove the claim when r, s are integers. The reason is that 
both sides are polynomials, and if the polynomials agree for an infinitude of integers 
then they must be identical. By changing n and k, we see it suffices to consider the 
special case m = 0, in which case we are reduced to showing 



E 



r \ I s \ /r + s 
k j \ n — k j \ n 



(2.3) 



Consider the polynomial 



If we use the binomial theorem to expand the left hand side of (12.41) . we get the coef- 
ficient of the x"?/''^'^^" is the left hand side of (|2.3I) : this follows from looking at all 
the ways we could get an x"?/''^'^"", which involves summing over the coefficients of 
rf.kyr-k tijQgs i\yq coefficients of x'^~'^y^~'^^'^ . Similarly, if we use the binomial theorem 



we find the coefficient of x^y"^^"^ " is the right hand side of (12.41) . This proves (|2.3I) . 
which completes the proof. □ 

Lemma 2.2. Let i, m, s be non-negative integers. Then 

i:(-)'(J.)Cr) 

Proof. Using (°) = J , we rewrite (''^'') as {^"^^tr) , and we then rewrite (^t-n) 
(— 1)'*+'^^" (^T^'^^l^) by using the extension of the binomial coefficient, where we have 
pulled out all the negative signs in the numerators. The advantage of this simplification 
is that the summation index is now only in the denominator; further, the power of —1 is 
now independent of k. Factoring out the sign, our quantity is equivalent to 

^ \ m + kj \s + k - nj ^ ' ^ \(. - m - k ) \s + k - n) ' 

(2.6) 

where we again use (^) = {^__^) ■ By Vandermonde's Convolution Lemma, this equals 

(-1)^^" L-'Z~nJ- Using G_r-:+J = (7-7) and collecting powers of -1 completes 
the proof (note (-1)^-™ = (-1)^+™). □ 



Using the above two lemmas, we can now prove our main result. 
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First Proof of Theorem U .1\ The case m = 1 follows by direct evaluation. Consider 
now m > 2. We have 



r=0 
m 

= B-i) 

r=0 
m 

r=0 
m 

= B-i) 



m \ f m + r 



^ J{r + l){m + r) 
m\ m + 1 / m + r\ 1 



r / m + 1 \ r / (r + l)(m + r) 
m!(m + l) 1 (m + r)(m + r — 1)! 1 



r + 1) • r!m! m + 1 r\m ■ [m — 1 + r)\ m + r 
m + l\/m — l + r\ 1 



r=0 



r + 1 J \ r J m{m + 1) 



We change variables and set m = r + 1; as r runs from to m, m runs from 1 to m + 1. 
To have a complete sum, we want u to start at 0; thus we add in the u = term, which 
is . As m > 2, this is from the extension of the binomial coefficient (this is the 
first of two places where we use m > 2). Our sum Sm thus equals 

s™ = -^^i:(-irf'"^')f'""'r). (2.8) 

m(m + 1) \ u /\m— 1 / 

We now use Lemma [Z2] with A; = u, m = 0, £ = m + l,s = m — 2 and n = m — 1; note 
the conditions of that lemma require s to be a non-negative integer, which translates to 
our m > 2. We thus find 

1 _l)-+i/^^-2\ ^ ^2.9) 



m(m + 1) \ —2 

which completes the proof. □ 

We give another proof of Theorem 11.11 below using hypergeometric functions, high- 
lighting other approaches to proving combinatorial identities. 

Second Proof of Theorem [777] Consider the hypergeometric function 

The following identity for the normalization constant of the Beta function is crucial in 
the expansions: 

B(..,)^/*-(i-*rv*^™. (2.11) 
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We can use the geometric series formula to expand (|2.10|) as a power series in z involv- 
ing Gamma factors, 

1 (a)l (o) 1 (c + n) n\ 

Rewriting (7) as {-XJ {^-''^-^), S„, can be written 

g ^ 1 (r-m-l)!(r + m- 1)! 1 

m!(— m — 1)! (r + 1)! r!' 

where we have formally extended the series to 00 as the coefficients will vanish for 
r > m + 1. By comparing the two infinite series and using the fact that z\ = T(z + 1), 
we see that if we take a = —m, b = m, c = 2, n = r and z = 1, after some simple 
algebra we obtain 

c. ^ r(m)2Fi(-m,m,2; 1) ^ r{m) 

r(2)r(l + m) r(l + m)r(2 + m)r(2-m)' ^' ^ 

where the last step uses 

.F,(a,6,c;l) = ^i^^^''." ° " . (2.15) 

r(c — a)T{c — b) 

which follows from the normalization constant of the Beta function. Note that the 
right hand side of (12.141) is 1/2 when m = 1 and for m > 2 because for such m, 
l/r(2-m) = due to the pole of r (2- m). 

□ 

Remark 2.3. It is also possible to prove Theorem U . 1 1 through symbolic manipulations. 
Using the results from HPSi IPSRH . one may input this into a Mathematica package, 
which outputs a proof (though not all the steps). The reasoning behind this automated 
proof method is described in nPWZQ . and many of the identities for hypergeometric 
functions can be interpreted in a very computational manner. These results are also 
useful in random walk calculations in physics (quantum and classical), and reduction 
to the hypergeometric function is a convenient first step towards continuum limits or 
long-time asymptotics. 

3. Effective Equidistribution 
For a sequence of numbers x„ modulo 1, a measure ^ and an interval / C [0, 1], let 

^{I) = / fi{t)dt. (3.1) 



The discrepancy Djy^ (/i) is 

Djy^if,) = \NjiVp)-VpixiI)\; (3.2) 
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with this normalization, the goal is to obtain the best possible estimate for how rapidly 
Diy^{jj) /Vp tends to 0. A standard approach is to use exponential sums and the Erdos- 
Turan theorem. Modifying the ideas in [MSj (see JMMJ for the details), one finds 

Theorem 3.1. Let {xn} C [0, 1] and let the notation be as above. Let {cm} be a 
sequence of numbers such that ^^^=-00 < 00. Let ||/i|| = sup^,gjQ y |-F(2;)| with 
= F{—x)dx. Then for any Vp and M the discrepancy satisfies 



^'^"^^^-M + l ^ \M+1 V 

l<m<M ^ ^ 



^e(mx„,) - VpCm 

n=l 



(3.3) 

Let fist = F{—x)dx be the normalized Sato-Tate distribution on [0, 1]. Its density is 

2 sin^inx) = + e(-x)) , (3.4) 

which implies that the coefficients of /ist are cq = 1, c±i = —1/2 and = for 
\m\ > 2. 

We consider the family of all elliptic curves modulo p for p >h. We may write these 
curves in Weierstrass form a& y"^ = x^ — ax — h with a, 6 G Z/pZ and Ao? 7^ 276^. The 
number of pairs (a, h) satisfying these conditionals 

Vp ■= Pip -I). (3.5) 



We use Birch's [lBi| results on the moments of the family of all elliptic curves modulo p 
(there are some typos in his explicit formulas; we correct these in [|MMl); unfortunately, 
these are results for quantities such as (2y/pcos 6'„)^^, and the quantity which naturally 
arises when applying Theorem 13.11 is e(mx„) (with x„ running over the normalized 
angles Oa,bip)/'^), specifically 



y] e{mxn) - VpCm 

n=l 



(3.6) 



By applying some combinatorial identities we are able to rewrite our sum in terms of 
the moments, which allows us to use Birch's results. The point of this section is not to 
obtain the best possible error term but rather to highlight how one may generalize and 
apply the framework from [MS|. 

We first set some notation. Let o-^iTp) denote the trace of the Hecke operator Tp 
acting on the space of cusp forms of dimension —2k on the full modular group. We 
have ak+iiTp) = 0{p^^^^''), where from [Sel] we see we may take c = 3/4 (there 
is no need to use the optimal c, as our final result, namely (13.231) . will yield the same 
order of magnitude result for c = 3/4orc = 0). Let M.p{2R) denote the 2i?* moment 



^If a = then the only b which is eliminated is 6 = 0. If a is a non-zero perfect square there are two 
b that fail, while if a is not a square than no b fail. Thus the number of bad pairs of (a, b) is p. 
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of 2 cos(6'„) = 2 cos(7rx„) (as we are concerned with the normalized values, we use 
slightly different notation than in llBill '): 

Mp{2R) = ^(2cos(7rx„))'''. (3.7) 

^ 71=1 

Lemma 3.2 (Birch). Notation as above, we have 

M,{2R) = -i- (^^) + O (2^^Vi'^) ; (3.8) 



R+1\R 

we may take c = 3/4 and thus there is a power saving^ 

Proof. The result follows from dividing the equation for S'^(p) on the bottom of page 59 
of ['BI'I by p^, as we are looking at the moments of the normalized Fourier coefficients 
of the elliptic curves, and then using the bound (jfe+i (Tp) = 0{p^^'^^^), with c = 3/4 
admissible by nSelH . Recall Vp = p{p — 1) is the cardinality of the family. We have 

1 f2R\ p{p - 1) 



Mpi2R) 



R + 1\RJ Vp 



^ 2k + l f 2R p 



'2R 



R + k+l\R + kJ Vp pW^ 



+ 0(2^^Vp ' ) (3.9) 



R + 1\R 

since = p(p — 1). □ 

A simple argumen{l shows that the normalized angles are symmetric about 1/2. This 
implies 

Vp Vp Vp Vp 

e{mxn) = cos(27rma:„) + i sin(27rmx„) = cos(2m6'„), (3.10) 

n=l n=l n=l n=l 

where the sine piece does not contribute as the angles are symmetric about 1/2. Thus it 
suffices to show we have a power saving in 



cos{2m6n) — Vpi 



n=l 



(3.11) 



Note -j^^ (^^) is the i?* Catalan number. The Catalan numbers are the moments of the semi-circle 
distribution, which is related to the Sato-Tate distribution by a simple change of variables. 

To see that we may match the angles as claimed for the family of all elliptic curves, consider the 
elliptic curve = — ax — b with 4a^ 7^ 276^. Let c be any non-residue modulo p, and consider 
the curve ip ~ x^ — ac^x — bc^. Using the Legendre sum expressions for aE{p) and aE'{p), using 
the automorphism a; — ex we see the second equals (^) times the first; as we have chosen c to be a 

non-residue, this means 2y/pcos{9E'{p)) = — 2y^cos(0£;(p)), or 6e'{p) = vr — 9e{p) as claimed. 
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By symmetry, it suffices to consider m > 0. 



Lemma 3.3. Let Cq = 1, c±i = — 1/2 and Cr, 
that 



otherwise. There is some c < 1 such 



COs{2m9n) — VpCn 



n=l 



(3.12) 



by the work ofSelberg nSelH we may take c = 3/4. 



Proof. The case m = is trivial. For m = 1 we use the trigonometric identity 

cos(26'n) = 2cos^((^n) " 1- As c±i = —1/2 we have 



^cos(2^„)-^ = 



n=l 



n=l 



(2 COS^ On - 1) 



-^((2cose„)2-l) 

n=l 

l^^(2Vpcos^„)2 
9 2^ 



n=l 



(3.13) 



Note the sum of (2y^cos is the second moment of the number of solutions modulo 
p. From [Bi] we have that this is p + 0(1); the explicit formula given in [IBH for the 
second moment is wrong; see [,MM I for the correct statement. Substituting yields 



Vr, 



^cos(20„)-f 



n=l 



« 0(1). 



(3.14) 



The proof is completed by showing that cos(2m6'n) = OmiVp^'^) provided 

2 < m < M. In order to obtain the best possible results, it is important to understand 
the implied constants, as M will have to grow with Vp (which is of size p^). While it is 
possible to analyze this sum for any m by brute force, we must have M growing with 
p, and thus we need an argument that works in general. As c±i ^ but = for 
\m\ > 2, we expect (and we will see) that the argument below does break down when 
\m\ = 1. 

There are many possible combinatorial identities we can use to express cos(2m6'„) 
in terms of powers of cos(6'n). We use the following (for a proof, see Definition 2 and 
equation (3.1) of tMll): 



2cos(2m6'„) = ^ C2m,2r(2 cos6'„; 



2r 



(3.15) 



r=0 
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where C2r = (2r)!/2, co,o = 0, C2m,o = (— 1)™'2 for m > 1, and for 1 < r < m set 

C2m,2r = \ \{m -j) = . (3.16) 

C2r jj-^ C2r (m-rj! 

We now sum (13.151) over n and divide by Vp, the cardinality of the family. In the 
argument below, at one point we replace 2^*^ in an error term with 2012-jy {^.^. ) ■ tti^', this 
allows us to pull the r"' Catalan number, (^^^ , out of the error termO Using Lemma 
[Owe find 

— 2 COs(2m6'„) = ^C2m,2rTT^i'2cOs9nY'' 

n=l r=0 n=l 



r=0 



E 



r + l\r 
1 (2r)! (-l)'"+^'2 m-(m + r)! 



r + 1 r!r! (2r)! (m — r)!-(m + r) 
(^1 + (mVp~^)) 



-ir2m5^((-l) 

r=0 ^ 



m\ (m + r)! 



r!(m — r)! m!r! (r + l)(m + r) 



;-ir2..5:((-ir(7)( 



m \ (m + r 



r=0 

l-C-E 



r y (r + l)(m + r) 



1 + mVp 2 . (3.17) 



We first bound the error term. For our range of r, ("^^'^) < (^™) < 2^"^. The sum of 
(") over r is 2"^, and we get to divide by at least m + r > m. Thus the error term is 
bounded by 



l-c-e 



O (^m'^2^"'Vp ^ j . (3.18) 
We now turn to the main term. It it just (— l)'"2m times the sum in Theorem 1 1.1[ which 



is shown in that theorem to equal for any \m\ > 2. Note that without Theorem ll.il 
our combinatorial expansion would be useless. □ 



^The reason this is valid is that the largest binomial coefficient is the middle (or the middle two when 
the upper ai-gumentis odd). Thus 2^'' = (1 + if'' < (2r + < 2{m + (as m < r), and the 

claim follows from > 2(m + 1) for m > 2 and < r < to. 
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Remark 3.4. It is possible to get a better estimate for the error term by a more de- 
tailed analysis of X]r<m (T) (™^'^) >' however, the improved estimates only change the 
constants in the discrepancy estimates, and not the savings. This is because this sum is 
at least as large as the term when r ^ m/2, and this term contributes something of the 
order 3'^™/^/m by Stirling's formula. We will see that any error term of size 3"™ /or a 
fixed a gives roughly the same value for the best cutoff choice for M, differing only by 
constants. Thus we do not bother giving a more detailed analysis to optimize the error 
here. 

We now prove effective equidistribution for the family of all elliptic curves. 
Theorem 3.5. For the family of all elliptic curves modulo p, as p ^ oo we have 

Di, v^ifist) < C-\ (3.19) 
log Vp 

for some computable C. 

Proof. We must determine the optimal M to use in (13.31) : 



as 



l<m<M 

« ^ + M2'''Vi'^ 

(3.20) 

YTTT -C — and V ^ 2^™ <^ 2^*^^. For all c > we find the minimum error by 
setting the two terms equal to each other, which yields 

= M^2^^^. (3.21) 

For ease of exposition we replace M^2^*^ with e^*^; this worsens our constant slightly, 
but does not qualitatively change the result. Equating these errors means we are looking 
for M such that 

which implies 

M = ^'""'' logVp. (3.23) 
o 

We thus see that we may find a constant C such that 

Di, v^i^ist) < C-\. (3.24) 

This yields a logarithm savings in the discrepancy, and proves effective equidistribution. 

□ 
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